Customizing And Evaluating A Multilingual Discourse Module
نویسنده
چکیده
In this papeh we first describe how we have customized our data-driven multilingu~fl discourse module within our text understanding system lor dill'erent lm~guages and for a particular NLP application by utilizing hierm'chic~dly organized discourse KB's. Then, we report qum~titalive and qmditative findings from ewduating the system both with and without discourse processing, ~md discuss how resolving certain kinds of mmphora affects system perh)r-
منابع مشابه
A Language-Independent Anaphora Resolution System for Understanding Multilingual Texts
This paper describes a new discourse module within our multilingual NLP system. Because of its unique data-driven architecture, the discourse module is language-independent. Moreover, the use of hierarchically organized multiple knowledge sources makes the module robust and trainable using discourse-tagged corpora. Separating discourse phenomena from knowledge sources makes the discourse module...
متن کاملMultiDPS - A multilingual Discourse Processing System
1 This paper presents an adaptable online Multilingual Discourse Processing System (MultiDPS), composed of four natural language processing tools: named entity recognizer, anaphora resolver, clause splitter and a discourse parser. This NLP Meta System allows any user to run it on the web or via web services and, if necessary, to build its own processing chain, by incorporating knowledge or reso...
متن کاملPragmatic Annotation of Discourse Markers in a Multilingual Parallel Corpus (Arabic- Spanish-English)
Discourse structure and coherence relations are one of the main inferential challenges addressed by computational pragmatics. The present study focuses on discourse markers as key elements in guiding the inferences of the statements in natural language. Through a rule-based approach for the automatic identification, classification and annotation of the discourse markers in a multilingual parall...
متن کاملMotivational Determinants of Code-Switching in Iranian EFL Classrooms
“Code-Switching”, an important issue in the field of both language classroom and sociolinguistics, has been under consideration in investigations related to bilingual and multilingual societies. First proposed by Haugen (1956) and later developed byGrosjean (1982), the termcode-switching refers to language alternation during communication. Although code-switching is unavoidable in bilingual and...
متن کاملTowards Using Machine Translation Techniques to Induce Multilingual Lexica of Discourse Markers
Discourse markers are universal linguistic events subject to language variation. Although an extensive literature has already reported language specific traits of these events (e.g. [6,7,4,3,9]), little has been said on their cross-language behavior and, subsequently, on building an inventory of multilingual lexica of discourse markers. Thus, this work describes new methods and approaches for t...
متن کامل